Practice Workbook: Excel Basics 
for Data Science 


Section 1: Importing and Exporting 


Concepts 
e Importing data 


e Exporting data 


Exercise 
e Add anew column to the left of temperature column, Fahrenheit. 
e Use formula to compute values from temperature column. 
o Note: Fahrenheit = 1.8C + 32 


e Export the data as a CSV file. 


Dataset 
e Dataset Link: https://bit.ly/CitiesDatasetl 





Section 2: Data Cleaning 


Concepts 


e Data Cleaning Techniques 


Exercise 


e Perform the data cleaning techniques outlined in the given guidelines document and 
save the file for further analysis. Guidelines: https://bit.ly/DataCleaningGuidelines 


Dataset 


e Dataset Link: https://bit.ly/DirtyTitanicDataset 
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Section 3: Business Questions 


Concept 


e Brainstorming 


Exercise 


e Come up with as many questions as possible that can be answered from your clean 
titanic dataset from Section 2. 


Section 4: Data Analysis Part I 


Concepts 
e Sorting, Filtering, Aggregation, Joining, Pivot Tables: Restructuring / aggregation / 
analysis 
Exercise 
e How many cities in Italy? 
e What is the average latitude: 
o Overall? 
o For cities with temperature < 10 ? 
o For cities with temperature > 10 ? 
o For cities where both the city name and the country name end in the letter “a” ? 


o Which are warmer on average — cities in the EU or cities not in the EU? 


Dataset 





e Dataset Links: https://bit.ly/CitiesDatasetl and https://bit.ly/CountriesDatasetl 





Section 5: Data Analysis Part Il (Titanic) 
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Concepts 


e Sorting, Filtering, Aggregation, Joining, Pivot Tables: Restructuring / Aggregation / 
Analysis. 


Exercise 


e How many passengers sailed for free? 

e How many married women over age 50 embarked in Cherbourg? 
e How many passenger ages are missing? 

e What is the average fare paid by these passengers? 


e What is the most common last name among passengers, and how many 
passengers have that last name? What is the average number 


e of passengers per last name? 
e What is the average fare paid by passengers in the three classes? 


e What is the average age of passengers in the three classes? 


Dataset 


e Dataset: Your Clean Titanic Dataset 


Section 6: Data Visualisation 


Concepts 


e Bar charts: When one axis is categories and the other is numeric. 
e Pie charts: To compare relative sizes of categories. 


e Scatter plots: When both axes are numeric. 


Exercise 
e How many passengers sailed for free? 


e How many married women over age 50 embarked in Cherbourg? 
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e How many passenger ages are missing? 

e What is the average fare paid by these passengers? 

e What is the most common last name among passengers, and how 

e many passengers have that last name? What is the average number 
e of passengers per last name? 

e What is the average fare paid by passengers in the three classes? 


e What is the average age of passengers in the three classes? 


Dataset 


e Dataset: Your Clean Titanic Dataset 
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